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ABSTRACT 

Context. Helioseismology has enabled us to better understand the solar interior, while also allowing us to better constrain solar 
models. But now is a tremendous epoch for asteroseismology as space missions dedicated to studying stellar oscillations have been 
launched within the last years (MOST and CoRoT). CoRoT has already proved valuable results for many types of stars, while Kepler, 
which was launched in March 2009, will provide us with a huge number of seismic data very soon. This is an opportunity to better 
constrain stellar models and to finally understand stellar structure and evolution. 

Aims. The goal of this research work is to estimate the global parameters of any solar-like oscillating target in an automatic manner. 
We want to determine the global parameters of the acoustic modes (large separation, range of excited pressure modes, maximum 
amplitude, and its corresponding frequency), retrieve the surface rotation period of the star and use these results to estimate the global 
parameters of the star (radius and mass). 

/Wefhods. To prepare for the arrival and the analysis of hundreds of solar-like oscillating stars, we have developed a robust and 
automatic pipeline, which was partially adapted from helioseismic methods. The pipeline consists of data analysis techniques, such 
as Fast Fourier Transform, wavelets, autocorrelation, as well as the application of minimisation algorithms for stellar-modelling. 
Results. We apply our pipeline to some simulated lightcurves from the asteroFLAG team and the Aarhus-asteroFLAG simulator, and 
obtain results that are consistent with the input data to the simulations. Our strategy gives correct results for stars with magnitudes 
below 11 with only a few 10% of bad determinations among the reliable results. We then apply the pipeline to the Sun and three 
CoRoT targets. In particular we determine the large separation and radius of the Sun, HD49933, HD181906, and HD181420. 

Key words. Methods: data analysis ~ Stars: oscillations 



1. Introduction 

During the past 30 years, helioseismology has proven its abil- 
ity to study the structure and dynamics of the solar interior in a 
stratified way. These seismic tools allow us to infer some phys- 
ical quantities as a function of the radius: the sound-speed (e.g. 
|Basu et al.|[T997 



the density (e.g. Turck-Chieze et al. 2001 



the ro tation profile in the convectiv e zone (e.g. [Thompson et a 
[1996 1, or in the radiative zone (e.g. Garcia et al.|2008[ ) are some 
well known examples. But helioseismology has also provided 
other quantities such as the position of the base of the convec- 
tion zone ( Christensen-Dalsgaard et al.|1985 Ballot et al. 2004[i 



and the photospheric Helium abundances ( Vorontsov et al. 1991 
|Basu & Antia '1995'). These observational constraints improved 
the standard solar models significantly. For instance, with the so- 
lar sound speed (e. g. Math ur et al.|2007 and references therein) 
the agreement between theory and seismic observations is better 
than a few parts per hundred (using the recent revision of so- 
lar abundances from Asplund et al. 2005 ) or even a few parts 
per thousand (with the old solar abundances of [Grevesse et al.| 
[T993] l. 



The Sun is a fundamental calibrator of stellar evolution (e.g. 
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Christensen-Dalsgaard 2009|, but observations of many other 
stars - covering the HR diagram through asteroseismology - will 
allow testing stellar evolution and dynamo theories under many 
diflFerent conditions ( Chaplin et al._2008b ). However, compared 
to the Sun, less information will be available from asteroseis- 
mology as only low-degree modes (those with a small number 
of nodal lines at the surface of the star) will be accessible due 
to the absence of spatial resolution in the observations. This re- 
duces the accuracy and spatial resolution of the inversions. On 
the other hand, some stars offer the possibility of observing grav- 
ity modes, thus giving access to the structure and dynamics of 
their radiative interiors (e. g. [Mathur et al.|2008a[ l. 

Nowadays, we are approaching a crucial period of astero- 
seismic observations. Several campaigns of ground-based obser- 
vations have been organised (e. g. Bedding & Kjeldsen 2003} 
Ar entoft et al.||2008] ). Besides, several space missions have al- 
ready been observing - for more than 3 years - solar-like os- 
cillating stars (whose acoustic oscillations are excited stochas- 
tically by convection): the American satellite WIRE (Wide- 
Field Infrared Explorer, Bruntt et al.|2005) , the Canadian satel- 
lite MOST (Microvariability and Oscillations of STars, Walker 
[et al.|2003] l, and the French-European-Brazilian mission CoRoT 
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(Convection, Rotation and planetary Transits, see e.g. |Michel| 
et al. 2008[ l. With the nominal observation cycle of CoRoT 



(changing the pointing field every six months), only a very few 
solar-like oscillating stars are observed each year (see for exam- 



ple: [Appourchaux et al.||2008 Garcia et al.|2009 Barban et al 



2009[ l, allowing us to analyse each individual target very care- 
fully. The launch of Kepler ( [Borucki et aL][2009l ) on March 7, 
2009, heralds a new era for asteroseismology, in that it will in- 
crease the number of solar-like stars observed for asteroseismol- 
ogy by two orders of magnitude. Kepler, which will search for 
habitable exoplanets, also has a wide asteroseismic programme, 
the Kepler Asteroseismology Investigation (KAI, Christensen- 
[Dalsgaard et al. 2008 ). Because it will observe a large number 
of stars for asteroseismology, Kepler presents many challenges 
for the efficient analysis of the data. The nominal Kepler lifetime 
is expected to be 3.5 years allowing very accurate asteroseismic 
studies of the same field all or the entire mission and the pos- 
sibility of measuring stellar cycles ( Metcalfe et al.|2007 Karoff 
et al.|2009» . 

Concerning the asteroseismic part, a survey phase will take 
place over the first 10 months of Kepler nominal operations (e. g. 
Chaplin et al.|2010| The main goal of this phase is to select the 



best stars that will be targeted during the rest of the nominal 
mission for, at least, two and a half years. Each star will be ob- 
served for one month at a time, putting constraints on the way 
how to extract the seismic parameters. For brighter targets we 
will be able to get individual frequencies, but for the majority of 
stars (fainter ones) this will not be the case and one must rely on 
extracting mean global parameters (see for example the analy- 
sis of a 26 days of CoRoT observations on a solar-like target by 
IMosser et al.p009l ). 

About a thousand stars are going to be observed during 
the survey phase at a one-minute cadence. Consequently, peo- 
ple working in the Kepler Asteroseismic Science Consortium 
(KASC) will have to analyse several hundreds of stars every 
month. This is the reason why we need to change our single-star 
analysis approach to an automatized pipeline that will provide 
global-seismic parameters as well as other first estimations of 
structural parameters like their radius or mass. Some teams have 
developed their own pipeline (Hekker et al. 2009a ; Huber et al.| 



|2009) within the AsteroFLAG group to be ready to analyse the 
Kepler data. 

The objective of this paper is to describe the pipeline we have 
developed to achieve this goal. The global parameters of the os- 
cillations that are some of the output of our pipeline can be used 



by modelers to constrain their stellar models (e. g. Metcalfe et al. 
|2009i|Stello et al.|2009b| l 

Sect. 2 describes the goals and methods of the different pack- 
ages of our pipeline dedicated to the determination of the p-mode 
global parameters. Then in Sect. 3, we explain how we retrieve 
the radius of the stars from these parameters. To illustrate that 
our pipeline works properly as well as its limits, we applied it to 
some simulated data by asteroFLAG team (Chaplin et al. [20081) 1 
and by a combined Aarhus/asteroFLAG simulator, to the Sun, 
and to three of the CoRoT targets in Sect. 4. Finally, we discuss 
the results of our pipeline and the strategy adopted. 

2. Pipeline package description 

The automatic pipeline that we have developed is called A2Z. It 
is divided into three main parts (see Fig. [T] for a detailed view 
of the workflow diagram). The first one is dedicated to the study 
of the global parameters of solar-like oscillating stars finishing 
by computing a first estimation of the mass and the radius of 



the star deduced from empirical scaling laws. The second part 
corresponds to the extraction of p-mode properties through the 
fitting of individual modes. The third part uses a pre-calculated 
grid of evolutionary models where we give as input previously 
determined parameters such as the large separation and the fre- 
quency range of the p-mode excess to obtain a better estimation 
of the radius and mass. 



)BAI, I'AIUMETERS 



FITTINti CHAR,\CTERISATION 







Frequencies 
of modes 



Parameters 
Pi (Ai, r„ 



MODEL t;R!D 




Fig. 1. Workflow diagram of the A2Z pipeline to extract the 
global parameters of stellar oscillations and the characteristics 
of the modes and to determine of the mass and radius through 
the comparison with a pre-calculated grid of models. The rectan- 
gles represent the code packages and the input/output are in the 
circles (each output has its corresponding error bar). The dashed 
lines correspond to the packages that are not described in this 
paper 



The starting point of our A2Z pipeline is either the time series 
or the power spectrum density (PSD) expressed in ppm^/yuHz. A 
Lomb-Scargle algorithm is used depending on the regularity of 
the sampling rate of the time series. 

The package APR (Sect. 2.1, average photospheric rotation) 
gives an estimation of the surface rotation period of the star 
analysing either the low-frequency part of the PSD or looking for 
the time-frequency signature of this periodicity using wavelets. 

PMRS (Sect. 2.2, p-mode range search), uses two different 
methods to estimate the mean large separation ((Av)) and gives 
the range of the p-mode excess power {fmm and /max)- These lat- 
ter values are then used to fit the background in the PSD with the 
package BF (Sect. 2.3, background fitting). Then, the third pack- 
age, BALO (Sect. 2.4, bolometric amplitude per ^"=0), calculates 
the amplitude envelope and gives the maximum bolometric am- 
plitude per radial mode (A^ax) as well as the corresponding fre- 
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quency (Vmax)- Finally, with these asteroseismic parameters, we 
can estimate the radius (R') and the mass (M') of the stars using 
empirical scaling laws with the package FDRM (Sect. 2.5, First 
Determination of the Radius and the Mass). 

For the package dedicated to the fitting of the modes, we 
need a table with the guesses of the mode frequencies. The pack- 
age Guessing Parameters (Sect. 2.6) uses the global parame- 
ters of the modes obtained previously to calculate a table of the 
guesses for the frequencies (vg_,) and the amplitudes (Agj) (where 
g is used for the guesses and / is the mode index) as well as a first 
estimation of the variation of the large separation with frequency 
(Ay(y)). 

Another package RaDex (Sect. 3, radius extractor) based on 
a grid of stellar models also estimates the radius of the stars using 
(Av), /min, and /nax- All the outputs of the different boxes are 
calculated with their uncertainties, e. 

If the signal-to-noise ratio (SNR) is high enough, the global 
parameters and the guesses are used by the maximum-likelihood 
Global Fitting code to infer the characteristics of each mode: 
the central frequency v,, the amplitude A,, the width F,, and the 
rotational splittings dv,. This global fitting code will be used only 
on a few stars during the survey phase of Kepler However, the 
characterisation of individual p modes is out of the scope of this 
paper. It has already been tested on the Sun a large number of 
times and applied to several CoRoT targets (e. g. Appourchaux 
let al.|2008, Barban et al. 20091 [Garcia etal.|2009l l. 

In the following sections, we describe the different packages 
and apply them to solar-like stars simulated under typical astero- 
seismic conditions. In some packages, we use different methods 
to cross check the results. 



2.1. Average Photospheric Rotation 

One of the first things that we want to study is the surface ro- 
tation period of the star. Thanks to seismology, we can inde- 
pendently determined the inclination from the relative amplitude 
of mode components inside multiplets ( Gizon & Solanki|2003 1. 
Unfortunately, the seismic estimates of the rotational splitting 
and of the inclination / -simultaneously obtained by fitting ob- 
served oscillation spectra- are strongly correlated for many of 
solar-like stars due to their short mode lifetime (for more details 
and discussion, see |Ballot et al.|2006||2008) . Thus, an indepen- 
dent determination of one of these two parameters (rotation or 
inclination angle) could help during the fitting procedure of the 
p-mode parameters to establish, for example, some a priori con- 
straints (e.g.jBenomar 2008 Benomar et al. 2009"). However, it is 
important to remember that the rotational splitting of the mode is 
an average value of rotation inside the cavity in which the mode 
propagates and can be different from the surface rotation if there 
is strong differential rotation in the interior of the star. 

To infer the rotation period of the stellar surface several 
methods can be used: studying the PSD or using the wavelets. 
The classical method consists in analysing the PSD at very low 
frequency. We look at the highest peaks below 5 fjHz (a value 
that can be changed to take into account more rapid rotators). 
However, with this method it is difficult to distinguish between 
the fundamental peak of the stellar surface rotation period and 
its first harmonics. 

The other method that we use in this work consists of ap- 
plying a time-frequency analysis using the wavelets ( |Torrence"& 
|Compo|1998| l. Before applying the wavelet tools, we reduce the 
number of points by rebinning the data by a boxcar function to 
speed up the calculation. We also filter the data to have only the 
low-frequency region. Then we use the Morlet wavelet, which is 



the product of a Gaussian and a sinusoid, thus a finite pattern and 
change the scale of the wavelet, i.e. its frequency (or period). By 
sliding it along the time series we obtain the wavelet power spec- 
trum (WPS), which is the result of the correlation between the 
wavelet and the data. This method has the advantage of lifting 
the uncertainty on the rotation period (see Sect. 4.2). This tool 
has also been extensively used to study solar and stellar activity 
( Mathur et al.|2008 b ). Besides, a very conservative approach has 
been followed by defining the cone of influence (COI) to deter- 
mine the limit of a reliable result: a periodicity should be seen at 
least four times to have more confidence on the value. 



An example of the WPS is plotted on Fig. 13 using solar data 



We calculate the sum of the WPS along time for each period (or 
frequency) of the wavelet. The result, which is the global wavelet 
power spectrum (GWPS), is represented as a function of the pe- 
riod (or frequency of the wavelet) (see Fig 13 right panel). We 
overplotted as well the threshold for a confidence level of 95% 
(chosen for our study). Any peak that is above this threshold has 
more than 95% probability not to be due to noise. The surface 
rotation period of the star is easily identified with this confidence 
level. 



2.2. P-Mode Range Search and large separation estimation 

We study the stellar oscillations spectrum by searching for the 
frequency range where the p modes present most of their power 
in the PSD. This search begins with the estimation of the mean 
large separation, (Av). 

It is important to have two independent methods to obtain 
the spacing, to cross-check the results, and to be able to have 
reliable results. 



2.2.1. Method 1 

As the modes are equally spaced in frequency in the PSD, we 
compute the fast Fourier transform of a slice between 100 and 
10000 yuHz in the PSD, (called hereafter PS2) and look for the 
highest peak (see Fig.|2]|. The large separation is defined as: 



Av = v„+i/ - v„f. 



(1) 



where v is the frequency, { the degree, and « the radial order of a 
mode. Between two consecutive orders of a given degree { of a 
mode, we have the £+1 mode, which is approximately positioned 
at a distance Av/2; therefore the periodicity of Av/2 occurs more 
often in the PSD. Thus, we assume that the highest peak in the 
PS2 will correspond to half of the large separation. 

Fig. |2] shows an example of a PS2 for a simulated star We 
find the highest peak at 34.5 yuHz, which gives (Av) = 69 //Hz. 
The signal-to-noise ratio of the amplitude of this peak is 6.22 cr. 
Then, in the PSD, we take a box of 600 yuHz starting at a fre- 
quency of 100 fiHz, calculate the PS2 normalised by cr, where 
cr is the standard deviation of the PS2 and we look for the peak 
around (Ay)/2 . For this search, we look for the highest peak in 
the PS2 in the range (Av)/2 + 10 //Hz. We repeat this by shifting 
the box in the PSD by 60 //Hz until 10000 yuHz. Then, we plot 
the power normalised by cr of the PS2 of the highest peak (or 
maximum relative power) found in this range as a function of 
the central frequency of each box. 

An example of this kind of plot is in Fig. [3] The horizon- 
tal error bars correspond to the size of the box of 600 /(Hz. To 
put some statistical constraints on this search, we calculate the 
threshold, s, corresponding to the probability of finding a peak 
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in the PS2 using the formula ( [Chaphn et al.|2002t [Gabriel et al.| 
[2002) 1: 



s = ln(AT)-lnP, 



(2) 



where A is the size of the box, T is the length of the time-series, 
and P is the probability. In Fig. |3] we have plotted the thresh- 
old for a probability of 95%. So we can say that the frequency 
range where most of the p-mode power is found goes from 937 
juHz to 2474 fjHz, which corresponds to the boxes where the rel- 
ative power of the peak is above the given threshold. Finally, we 
calculate the mean value of the (Av) found in each independent 
box between /n,in and /max as well as the dispersion giving us the 
uncertainty on our determination of (Av). 



1.2x10 




2.0x10 



50 100 
Frequency (/xHz) 



150 



Fig. 2. Power spectrum of the power spectrum (PS2) between 
100 and 10000 fiHz as a function of the frequency for one simu- 
lated star (Pancho) and normalised by cr of the PS2. 



But as we can have very low SNR stars, we allow the thresh- 
old to go down to as low as a probability of 70% if the higher 
probabilities are not found. 

When we do not find signal above 70% of confidence level, 
we change the size of the box of 600 //Hz and try two different 
values, 900 and 300 //Hz, and start the search again. We use a 
flag (box) to record the size of the box. 

We have found in some simulated stars that two dissociated 
regions can be above the threshold, thus two bumps: the correct 
one and another one due to noise, which gives us a very wide 
range for the p-mode region. Thus, to determine the correct re- 
gion, we use again a box of 900 or 300 //Hz and if we have only 
one bump in the new analysis, we select the common bump of 
both analyses. By doing so, we have improved a few results. But 
if we still have two bumps, we use a flag (dblpk) to inform us 
about the presence of this double bump. 

With the simulated data, we have noticed that the highest 
peak was not always the one corresponding to half of the large 
separation. So the first estimation of (Av) might not be correct 
and might be due to noise. That is the reason why, if we do not 
find any signal above 70% of confidence level in the Maximum 
relative power, we look for the second highest peak and the third 
highest peak in the PS2 and do the whole search again. 
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Fig. 3. Typical maximum relative power of the highest peak in 
the PS2 around half of the large separation, as a function of the 
central frequency of the sliding box taken in the PSD for one 
simulated star (Pancho). The dashed line represents the threshold 
corresponding to a 95% confidence level. The error bars repre- 
sent the size of the boxes taken in the PSD (600 //Hz in this case). 
Thus, the range of the p-mode power excess is [940, 2440] //Hz. 



We also have a first estimation of v^ax, the frequency of 
maximum amplitude, by taking the frequency of the highest 
Maximum relative power 

Once we have determined (Av) and we have an estimation 
of Vmax, we verify our results by using scaling laws ( |Stello et al. 



2009a I, which establish the following relation: 



<^^> = ^^^^(305o) • 



(3) 



If the (Av) found with our method does not fall within an uncer- 
tainty of 20% around the theoretical value, we use a flag. 

As an alternative, we can run this code a second time with 
an estimation of (Av) as an input and by constraining our search 
for a peak in the PS2 around this theoretical value +15 //Hz. 

The output of this part of the pipeline is the value of the large 
separation and the frequency range of the p-mode bump with a 
given confidence level. 

2.2.2. Method 2 

The second method to look for the large separation is based 
on the estimation of some parameters with spectroscopic obser- 
vations. From the fundamental stellar parameters: parallax, ef- 
fective temperature, apparent magnitude, and log g or as in the 
Kepler stars: effective temperature, log g, and radius, given in the 
KIC (Kepler Input Catalog, [Latham et al. 2005 1, we use the scal- 
ing laws from Kjeldsen & Bedding '{ 1995 1 to estimate the range 
in frequency for the p-mode excess power and the large sepa- 
ration. With these initial values, we look for (Av) in the power 
spectrum (PS) of the data as well as in some average spectra by 
taking several subseries in the data. We can choose the number 
of average spectra. 

The search is done by computing the PS of a short slice (typ- 
ically 900 //Hz) of the PS where the p-mode excess power ap- 
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pears ( Regulo & Roca Cortes|2002 1. The search for the spacing 
is done iteratively by trying different values in a 50 //Hz -range 
around the estimated spacing and with a step of 0.01 /iHz. We 
try to find if there is any signal above 1.5 times the rms of the 
power spectrum at any of the bins spaced by (Av). To evaluate 
the significance of the peaks found and to avoid binning effects, 
this procedure is repeated 50 times and by continuously shorten- 
ing the length of the 900 //Hz-slice, down to Ix(Av). The coin- 
cidences of spacing among the peaks found in the 50 trials are 
then registered (See Fig.[4|. 




60 70 80 

Spacing (juHz) 



Fig. 4, Result for the search for the spacing in one month of data 
for one of the simulated stars (Pancho). The highest peak corre- 
sponds to the large separation present in the data. 



From this search we obtain for the whole spectrum as well as 
for the averaged spectra a possible value of (Av). We also fit the 
histogram of coincidences with a Gaussian profile. The width of 
the latter gives the error bar (cr) associated to the value obtained 
for the spacing. Then, we take the mean value of all the (Av) 
found in each data series, typically 5 for a month of data. The 
values that are 3 cr above the mean value are rejected and a new 
mean value with its error is estimated. This process is repeated 3 
times and the result with its error is taken as the large separation 
for the analysed data set. However, if the data have a poor SNR 
or if the fundamental stellar parameters are badly determined, it 
can lead to the wrong spacing, and it is very difficult to identify 
this result as incorrect. 

In the possible case for which the fundamental stellar pa- 
rameters are ill-determined, we have the possibility to run all 
the pipeline a second time using as an estimation for the p-mode 
frequency range and for the large separation, the results obtained 
bhndly by Method 1 described above. 

2.2.3. Cross-checking the two methods 

To cross-check the large separations obtained with the two meth- 
ods, we follow three steps: 

First step: We run both methods. Method 1 based on the blind 
estimation of the initial parameters, and Method 2, with the ini- 
tial parameters obtained from the scaling laws. We obtain some 



results that are compatible between both methods and some that 
are different. 

To cover the possibility to have selected a wrong region for 
the p-mode range in the blind way of finding it in Method 1 or 
the possibility to have a bad determination of the fundamental 
parameters in Method 2, we implement two more steps. 

Second step: Method 2 is run using the blind estimations 
from Method 1 and the new results of Method 2 are compared 
with the results from Method 1 . 

Third step: Method 1 is run using the scaling law estimations 
from Method 2 and the new results of Method 1 are compared 
with the results from Method 2. 

As explained in Sect. 2.2.1, we calculate the threshold for 
detecting a peak in the PS2 for a given confidence level. We 
say that we are very confident when we are above a threshold 
of 85% (confidence level of 2), less confident between 70 and 
85% (confidence level of 1), and not confident at all below 70% 
(confidence level of 0). 

To select the most reUable results, we dismiss the step in 
which we obtain the highest number of common stars with a 
confidence level of 0. Among the 2 remaining steps in which we 
remove the stars with the confidence level of 0, we choose the 
step where we have the highest number of coincidences. 

As a fourth step, to try to recover the spacing of more stars, 
we look at the common stars in the two comparisons in which we 
have obtained the lower number of stars with zero confidence 
level. For example if among 300 stars we have 170 stars with 
common results with step 1 (with a confidence level of 1 and 2) 
and in step 2 there are 195 of them, we check if the first 170 
common stars from step 1 are included in the 195 from step 2. If 
only 150 stars are the same between step 1 and 2, we accept the 
other 20 as correct results and we add them to the 195 obtained 
from step 2. Finally, we will have 215 common stars. 

2.3. Background Fitting 

The background of the spectrum is modeled with three compo- 
nents: 

1. white noise to model the photon noise (W); 

2. Harvey's model which reproduces the convective contribu- 
tion to the background ( Harvey|1985 i; 

3. a power law modelling activity effects and low-frequency 
trends. 



Our model has then 6 free parameters: 



B(v) = W -h 



1 + (2y/vcr 



+ av 



(4) 



All of the parameters have to be positive, which is a constrain 
in the fitting code. The model is fitted to the spectrum over 
the domain [vi,/min] U [/max,Vcut] using a classical maximum- 
likelihood estimator. 

The parameters /min and /max are the bounds of the interval 
where p-mode power excess has been detected and returned as 
an output of the package PMRS. v^ut is the cut-off frequency and 
vi the lowest considered frequency (in practice, we consider vi = 
1 /iHz). In other words, we simultaneously fit the full spectrum 
after having excluded the p-mode region. 

We use as an initial guess for W the average of the power 
spectrum around the cut-off frequency, where the photon noise is 
expected to dominate the other components. To find guesses for 
a and b, we perform a log-log regression in the low-frequency 
domain. The only input guess requested by this package is v^.. 
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The value of the spectrum around the frequency Vc is then used 
to extrapolate a guess for A. We have typically used values in 
the range 100-1000 //Hz for the guess of v^. . By using this set of 
guesses, a first fit is performed where a is kept fixed with a value 
of 2. The fitted values are then used as guesses for a second fit 
where all of the parameters have been let free. 

We have noticed with our benchmark of tests that this routine 
is robust and stable. The final results depend indeed only weakly 
on the exact value used for the input guess of Vc- 

2.4. Characterisation of the P-Mode Envelope (Bolometric 
Amplitude per C=0) 

Once we have fit the background with the package BF, we sub- 
tract it from the PSD as we need to estimate the amplitude of 
the modes without the contribution of the stellar noise. Then, we 
smooth the new PSD using a sliding window of q x(Av), where 
2 or 3. We obtain the envelope of the power spectrum. We 
fit the result with a Gaussian function, which gives the maximum 
power, P„iax in ppm^//Hz ' and the frequency position, Vmax- We 
also recover the error estimate from this fitting, which gives us 
the standard deviation related to both coefficient, P^ax and v^ax- 
To convert this power into amplitude, Ampl,„ax, we use the fol- 
lowing formula as we have a single-sided PSD: 
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Fig. 5. Typical bolometric amplitude of the mode ^ = as a func- 
tion of the frequency for one simulated star. 



Amp/max = V2 X Anax X ^ X <Av>. 



(5) 



Then, we want to calculate the rms maximum amplitude per 
radial mode or the bolometric amplitude, A^ax- As we use the 
method described in Kjelds en et al.| (j2008|, the formula for the 
amplitude is the following: 



Amp/max / ^J2XqXC = V(^'max X <Av))/c, 



(6) 



where c is related to the spatial response to the observations of 
the modes of degree { - Q to 4 and to the type of observations 
(intensity or velocity). For the simulated stars, we took c=3.16 
as given in Kjeldsen et al.| (|2008||. However, the correction to 



obtain the bolometric amplitude per radial mode depends both 
on the instrument and the star (Tg^ff), thus it should be calculated 
for each star and for a given instrument. This has been done by 
[Michel et al.| ( |2009] l for the CoRoT mission and should be done 
for the Kepler mission as well. Finally, we can plot the bolomet- 
ric amplitude per radial mode in the frequency range found in 
the package PRMS, which is shown in Fig.|5] 

From now on, when we talk about the maximum amplitude 
of the mode, we will refer to the rms maximum amplitude. 

2.5. First Determination of the Mass and the Radius 

We assume here that we do not have any information on the ra- 
dius and the mass of the star beforehand, which is usually the 
case. Instead of using the scaling laws (Kjeldsen & Bedding 
1995 Bedding & Kjeldsen|2003j ) to estimate (Av) and v^ax^ we 
use them as input. We solve the two equations with two unknown 
variables: the radius of the star R' and its mass M'. We can cal- 



culate them according to formulae (9) and (10) from ( Kjeldsen 
& Bedding! 1995] ). 
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(7) 



(8) 



Concerning the uncertainty on these values, we have to take 
into account the error bars on (Av), Vmax- This leads us to uncer- 
tainties of about 10% and 20% respectively for the radius and the 
mass. These results depend on the validity of the scaling laws. 
It has been recently shown, thanks to the CoRoT mission, that 
they are closely followed for red giants ( Hekker et al.|2009b de 
Rid der et al.||2009] ) and one of the first goals of the asteroseis- 
mic investigation of the Kepler mission will be to check their 
validity. 

This package gives a first estimation of the mass and radius 
while the package described in the next section to estimate the 
radius is more accurate but more time consuming. 



2.6. Guessing Parameters and variation of the large 
separation with frequency 

In this section, we describe how the guess-parameter table is 
calculated, which is needed to perform the peak fitting of the 
individual modes. The guess frequencies thus obtained provide 
a first estimation of the frequency of each {(, ri) mode and the 
large-separation variation with frequency. 

The power spectrum is corrected from the background ob- 
tained by the package BF and smoothed by a factor of 2% of 
the estimated mean large separation (Av) (Sect. 2.2). The guess 
table is constructed over the frequency range [/miru/max] where 
p-mode power excess was found (see Sect. 2.2). The frequency 
bin of the highest peak in the power spectrum around the es- 
timated v,„ax (Sect. 2.4) is found. Then, the consecutive mode 
frequencies Vg_,-ti are determined by searching for the highest 
peaks within frequency windows of (vg ,- ± Av,) ± 0.15Ay,-, where 
the large separation Av, is continuously updated based on the 
previously measured frequencies. 

The guesses of the Lorentzian-mode amplitudes are calcu- 
lated from the maximum power corresponding to each guess fre- 
quency. 

A first estimate of the frequency dependence of the large sep- 
aration Ay(y) is thus obtained. An example for 30 days of a sim- 
ulated asteroFLAG star is shown in Fig.|6] 
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Fig. 6. Frequency dependence of the large separation of the 
group of even modes I - Q - 2 measured from 30 days of one 
simulated asteroFLAG star (Boris). The triangles represent the 
input values, and the circles the results from the pipeline. 
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Fig. 7. The fitted masses and ages for a simulated star The filled 
circles show the masses and ages derived from our fitting proce- 
dure starting from eight different initial guess values. The input 
value is denoted by the square. 



3. Radius from a grid of stellar models 

Once we have robust estimates of (Av), /min and /^ax (see 
Sect. 2.2), we couple this information with the available atmo- 
spheric parameters to estimate the radius of the star using stellar 
models. In the specific case of Kepler data, the atmospheric pa- 
rameters that we use and that are available to us from the KIC 
are \ogg, [M/H], and Teff. To determine the global parameters, 
we compare the observations ((Av), log^, [M/H], and T^ff) with 
model observables from stellar evolution and pulsation codes. 
We use the Aarhus Stellar Evolution Code (ASTEC) coupled to 
an adiabatic pulsation code (ADIPLS) ( |Christensen-Dalsgaard| 
|2008 b a). These codes need as input the stellar parameters of 
mass, age, chemical composition and mixing-length parameter, 
and return the stellar model interior profiles, global parameters, 
such as radius and effective temperature, and the frequencies of 
the oscillation modes. 

The parameters that best describe the observables are ob- 
tained by minimising a function; 
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Fig. 8. The residuals between the observations and model ob- 
servables for four of the models from Fig|7] From left to right, 
the observables are: T^g, logg, [M/H], and Av. 



X' 



(yi - Bi 



(9) 



where y and B are the / = 1,2, ...,M observations and model 
observables. The Levenberg-Marquardt algorithm is used for 
the optimization, and this incorporates derivative information to 
guess the next set of parameters that will reduce the value of 
X^. Naturally, an initial guess of the parameters is needed and 
these are obtained from a small grid of stellar evolution tracks 
(Tfif{,logg) where we assume (Av) is proportional to the mean 
density of the star. 

Because there are few observations and just as many pa- 
rameters, there are inherent correlations between mass, age, and 
metallicity. To help avoid local minima problems, we minimise 
the;^^ function beginning at several initial guesses of the param- 
eters (mainly varying in mass and age), and these initial guesses 
are estimated from the grids. For each star, we therefore obtain 
several sets of parameters with a corresponding value that 
match the observations as best as possible. To highlight the cor- 
relations we obtain, Fig.|7]shows several values of mass and age 
that adequately match the observations of a simulated star The 
square symbol highlights the true input values of these models. 



Fig. [8] shows the residuals (observation - observable) from 4 of 
the best-fitting models. It can be understood quickly from these 
figures, that there are difficulties with extracting both the mass 
and the age of the stars. 

Luckily, the radius parameter is highly dependent on the set 
of observations. This means that although we obtain quite a scat- 
ter in mass and age for each of the simulated stars, we do obtain 
robust estimates of the radius. 

The radius is calculated as the average value from the best- 
fitted models where the value is below 3.9^. The error is de- 
fined as one sixth of the difference between the maximum and 
minimum radius. The value of six stems from the "error" being 
defined as half of the scatter in values, and then we divide by 3 
to produce an estimate of the 1 cr uncertainty, assuming that the 
range of values spans 3.9 cr. The choice of dividing by 3 instead 
of 3.9 is to account for the low number of models that we are 
using to estimate the paramaters. 
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4. Results 

4.1. Simulated data 

In this work, we have used simulated data generated within the 
AsteroFLAG team for hare-and-hounds exercises ( Chaplin et al. 
|2008 a) dedicated to prepare the Kepler mission. We have also 
applied our pipeline to data from the Aarhus simulator ( Stello 
|et al.|2004j . These simulated stars are based on the information 
available in the KIC of the stars that will be first observed by 
Kepler during the survey phase. 

4.1.1. AsteroFLAG hare and hounds 

We have decided to work on three stars (or "cats") simulated by 
the AsteroFLAG team: Arthur, Boris, and Pancho. For each of 
them, we have 2 different rotation rates (low and high), 3 angles 
of inclination (0, 30, and 60°), and 4 magnitudes (from 9 to 
12). From these 72 simulated stars with different fundamental 
parameters, 2592 independent spectra of one month have been 
obtained (36 for each star). These 2592 spectra have been 
analysed independently with the methods described above 
for the package PMRS and the results have been compared 
following our cross-checking methodology (Sect. 2.2.3) for 
estimating (Av). 

Table [T] shows the statistics of the common results found for 
all the 2592 spectra according to the type of star. We can see 
that by exchanging our results in steps 2 and 3, the number of 
common results increases. For a given step, if we have a high 
number of common stars with a confidence level of 0, it means 
that the common results could be incorrect. This is due to the 
fact that if there is some signal (real or not) and if both methods 
search around the same region, they should find almost similar 
results. 

4.1.2. Aarhus/asteroFLAG simulator stars 

We have also tested our pipeline on 176 stars simulated with 
the Aarhus/asteroFLAG simulator These stars have magnitudes 
from 7 to 11 and they are one-month time-series. 

Once again we have analysed the common results for the dif- 
ferent steps of our methodology of the PMRS package (Table 1). 
For step 1, we have 33 results in common with 3 results with 
level 0. For step 2, we have 75 results in common and 4 level 0. 
Finally, for step 3, we have 78 common results with 36 level 0. 

4.1 .3. Strategy to select reliable results 

The strategy adopted in this work is to look at the number of 
common results in each step described previously and among 
them, to take into account the number of results with a confi- 
dence level of 0. We have to make a trade-off to have the high- 
est number of common stars with the lowest number of zero- 
confidence level stars. 

We recap here how we select the best step. First, we remove 
the step where the number of stars with a confidence level of 
is the highest. Then, we choose the step with the highest number 
of common stars and lowest number of stars with confidence 
level. We select only the results where the confidence level is 
equal to 1 and 2, so we subtract the number of results with a 
zero-confidence level. Finally, we add up the stars found in the 2 
best steps as all the common results might not be included in the 
common results of the other step. 



By comparing the results to the input, a tolerance of 5% of 
the real value for (Av) is accepted. If we are within this error, we 
say that the result is "correct". 

For instance, for the case of Pancho (Table[T]), we notice that 
for steps 1 and 2, the number of confidence level of is quite 
close: 6 and 9. But the number of common results is higher for 
step 2, 259 compared to 231 for step 1. So we choose the com- 
mon results of step 2. When we remove the results with a con- 
fidence level of 0, this gives 250 confident results. By applying 
it to all the stars, we select: 25% of the common results with 
Arthur, 34% for Boris, 29% for Pancho, and 40% for the Aarhus- 
asteroFLAG simulated stars. Among them, we know that there 
are respectively 9%, 2%, 11%, and 6% of incorrect results. 

Finally, according to the last step of our methodology, if we 
add the stars that are in common in step 1 in the case of Pancho 
and that do not appear in the common stars in step 2, we obtain 
293 stars for which we are confident. Thus we would have 29% 
of common results for Arthur, 38% for Boris, 34 % for Pancho, 
and 40% for the Aarhus-asteroFLAG simulated stars, with re- 
spectively 16.8%, 2.8%, 13.3%, and 6% of incorrect results. 

Now, if we look at the results in terms of magnitude (Table|2] 
first half) we notice that before applying the last step, for a mag- 
nitude V=9, we manage to have 87% of common results, with 
only 0.5% of incorrect results. Then, for V=10, the number of 
common results decreases to 25.6% with 10.8% of bad deter- 
minations. Finally, with higher magnitudes, the results are not 
convincing as we have more than 50% of wrong determinations. 
So we can say that our pipeline works well for stars with V<1 1. 

After applying the last step (Table|2] second half), the results 
are the following: for V=9, we have 91.5% of common results, 
with only 0.7% of incorrect results, for V=10, we have 33.3% of 
common results with 14.3% of bad determinations. 

For the simulated stars of Aarhus/asteroFLAG, this strategy 
leads to 71 common results out of 176 (40%), among which 
67 results are correct. Applying the last step does not change 
the result. Fig. |9] represents the distribution of correct results in 
terms of magnitude, T^ff, and log g. We notice that for magni- 
tudes lower than 8, 75% of the stars are retrieved. For 8<V<9, 
we have a good estimation of (Av) for 63.1% of the stars, for 
9<V<10, 48.9% of the stars, and for V>10, 27.1% of the stars. 
These results are quite close to what we have obtained with the 
cats. 

We have applied the other packages, BF and BALO, to these 
67 stars and estimated the maximum amplitude per radial mode. 
Fig. 10 shows the difference between the maximum amplitude 



estimated (using a smoothing of 3xAv) and the input. We clearly 
notice that for most of the stars, we underestimate the ampli- 
tude per radial mode. This can be due to the smoothing of the 
spectrum and maybe to the background fitting. We notice that 
the Gaussian fit is most of the time below the p-mode envelope. 
There is not any correlation between the correct estimation of 
the amplitude and the magnitude of the star (Fig. 10 left panel), 
or with the position of the maximum amplitude in frequency 
(Fig.~ 



10 



right panel). For 73% of the stars, we manage to re- 
trieve the maximum amplitude per radial mode within 3 cr. We 
have also calculated these amplitudes with different values for q. 
The bias varies between -10.8 +9% and -14.6 ±8%. For instance, 
with a smoothing of IxAv, we obtain that 81 % of the results are 
within 3 cr. 

For a few tens of the survey stars (squares in Fig.|9]l, we have 
calculated the radius following the method described in Sect. 3. 
Fig.|9]shows the models that converged with our method (green 
squares) as well as the models that do not converge (red squares). 
We selected only a few tens of stars for time and computer lim- 
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Fig. 10. Relative difference between the maximum amplitude calculated with the A2Z pipeline (using q=3) and the input values of 
the Aarhus-asteroFLAG simulated stars as a function of the magnitude (left panel) and the frequency of the maximum amplitude 
(right panel). We overplotted the error bars for each estimation. 
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Fig. 9. Distribution of the correct results obtained with the sim- 
ulated stars from the Aarhus-asteroFLAG simulator in term of 
V magnitude, Teff, and log g. The open circles represent the 
176 simulated stars. The full black circles represent the stars for 
which the correct value of the large separation was obtained. The 
squares are the stars selected to determine the radius: the green 
squares are the converging models and the red squares are the 
diverging models. 



itations. The selection was also done such as to have a panel of 
different stars, specially with a wide range for the mean large 
separation. The results are given in Table 3. We also give the dif- 
ference between the true input value and the value obtained with 
the pipeline and among 18 stars, we obtain the radius within 
3 cr for 13 stars. Fig. 1 1 shows the input values of the radius for 
the simulated data versus the fitted values and their errors bars 
obtained with the pipeline. The continuous line is the x=y line. 




1.1 1.2 1.3 

Fit Radius (Rg) 



1.5 



Fig. 11. The true radius versus the fitted input radius for a se- 
lection of the simulations. The continuous line is the x=y line. 



The figure shows that the determination of the radius from the 
pipeline is a rather good determination of the true input values 
within a few%. We note that there are some stars that were not 
estimated, and the stellar models most likely did not converge 
due to unphysical sets of parameters as input. Some more work 
is needed in the code, so that the automatic pipeline will give 
results for > 95% of the stars. At the moment the success rate is 
-70%. 



4.2. Solar data 

In this section, we have analysed solar data to test our pipeline 
on some real data. We have estimated the solar surface rota- 
tion with the wavelets. We could not use intensity measurements 
from the VIRGO instrument aboard SoHO as the data available 
are filtered above 1 and 10 days, which prevents us from measur- 
ing the rotation period. Therefore, we have applied the wavelets 
to velocity data obtained during the first year after the launch 
of the GOLF (Global Oscillation at Low Frequency) instrument 
(Garcia et al. 2005). We have rebinned the data to have one point 
every 2 hours. We have also applied a filter to remove periodic- 
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ities above 50 days. First we did a simple fast Fourier transform 
to calculate the PSD (Fig.[T2|. We notice four high peaks below 
1 yuHz, at 0.41, 0.8, 1.2, and 1.6 /kHz, which can be attributed 
to the rotation period and a high peak at 0.1 fiHz. However, we 
cannot disentangle the fundamental period. 

>> 
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Fig. 12. Low-frequency part of the PSD obtained with one year 
of solar data from GOLF instrument as a function of the fre- 
quency. The dotted lines highlight the highest peaks below 1 yuHz 
that represent the rotation period and the first harmonics. 
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Fig. 13. Left panel: Wavelet power spectrum as a function of 
time and frequency of the Morlet wavelet. The period is shown 
on the right axis of the plot. The green grid represents the 
cone of influence delimiting the reliable periodicity. Black and 
red colours denote high-amplitude power whereas green and 
blue colours highlight low-amplitude power. Right panel: Global 
wavelet power spectrum as a function of the frequency of the 
wavelet. The dashed line represents the threshold for a 95% con- 
fidence level. 
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We have then applied the wavelets to these data (Fig. 13 1. We 
notice that there is an accumulation of power along time around 
0.4-0.5 juHz. The COI (green grid) shows that a frequency below 
=s0.2 yuHz is not reliable, rejecting thus the peak at 0.1 fiHz. The 
Global Wavelet Power Spectrum, which is a collapsogram over 
time, shows a maximum at 27 days (0.42 fjHz). The width of the 
peak is related to differential rotation visible at the surface of the 
Sun. We can also see a peak around 0.8 yuHz, but in the WPS 
there is a power excess only for a short period of time, telling 
us that this is not the fundamental peak. Thus, this tool is very 
powerful as it enables us to recover without any ambiguity the 
real rotation period at the surface of the Sun. This was confirmed 
using several sets of simulated data as well as solar data obtained 
with GOLF instrument. 

We have used the package PMRS to estimate the mean large sep- 
aration for the Sun. We have applied it to the VIRGO data, by 
taking series of 30 days of the blue SpectroPhotoMultiplier, to 
simulate what we will have with the survey data of Kepler. Using 
Method 1, we have found <Av) = 135.46 ±3.3 yuHz. As the SNR 
is quite high, it is obtained with a window of 600 //Hz. There 
is no double bump. This is also obtained with the highest peak 
in the PS2. Finally, the p-mode excess power is found between 
2039 and 4919 //Hz (Fig. 14 1. All these values have a confidence 



level of 95% and completely agree with the well-known oscilla- 
tion parameters of the Sun. 

Fig. 15 shows the background fit on the PSD. We have sub- 



tracted it from the original PSD and we have calculated the max- 
imum amplitude per radial mode with the package BALO. For 
the instrumental response of VIRGO, we have used the response 
function given in [Michel et al.| ( |2009[ ). This gives the ampli- 



o 
> 



20 



15 



10 



a 



95.%. 





2000 4000 6000 

Frequency (/itHz) 



8000 



Fig. 14. Maximum relative power for one month of the VIRGO 
data of the Sun from package PMRS. Same legend as Fig.|2] The 
excess of power of the p modes is in the range 2039 to 4019 fiHz. 



tude envelope plotted on Fig. 16 (black curve). Thus, we obtain 
Amax=2.99 ±0.13 ppm and v,„ax= 3082 ±1.08 //Hz (Table 4). 
Top panel of Fig. 17 shows the variation of the large sepa- 



ration with frequency. Our results are compatible with those of 
Frohhch et al.| ( fT997l i. 

The result for the radius of the Sun is given in Tables 5 and 
6 and we see that it is fully retrieved within the error bar. 



S. Mathur et al.: Determining global parameters of the oscillations of solar-like stars 



11 




The frequency dependence of the large separation extracted 
(Sect. 2.5) is illustrated in Fig. 17 The large separations 
thus measured were compared with published frequency tables 
dGarcia et al.l |2Q09; Ba rban et al . 2009; B enomar et al.|[2009] l. 
They provide a qualitative good first estimates of their frequency 
dependence. 



Sun (VIRGO) 



Fig. 

one 



1.0 10.0 100.0 1000.0 10000.0 

Frequency (/xHz) 



15. Background fitting (green curve) of the solar PSD for 
month of data from VIRGO. 



4.3. Solar-like oscillating stars observed by CoRoT 

We have applied our pipeline to three targets of the CoRoT mis- 
sion: HD49933 (Appourchaux et al. 2008 ; Benomar et al.|2009 



HD 18 1906 ( Garci a 'et ai:||2009K and HD181420 ( |Barban et a 
[2009) . For HD49933, two runs are available: (1) 60 days and (2) 
137 days. For HD181906 and HD181420, the data sets are 156 
days long. 

The results of the pipeline are summarised in Tables 4, 5, 
and 6. The large separations and the p-mode range were obtained 
with the Method 1 of PMRS (Table 4). They all agree with re- 
sults already obtained in published papers. Fig. 16 shows the 



bolometric amplitude in the frequency range where the p-mode 
bump was found for all these stars. 
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Fig. 16. Smoothed bolometric amplitude per radial mode (rms) 
for the Sun, HD49933 (run 1 and 2), HD181906, and HD 18 1420. 
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Fig. 17. Frequency dependence of the large separation obtained 
as in Sect. 2.5 in the case of the Sun (by using 30 days of VIRGO 
observations) and three COROT stars: HD49933, HD181420, 
HD181906 (circles). The results based on published frequencies 
are also shown (triangles). In the case of HD49933, the squares 
represent the results obtained from the second run of observa- 
tions. Only the group of even modes ^"=0-2 are represented. 
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With the package FDRM, we have estimated the mass and 
radius of the stars (Table|5]). Within the error bars, they are com- 
patible with the data we have with spectroscopic observations. 
So we can have a rough estimation of the mass and radius if we 
determine (Av), and the frequency range of the p modes. 

Using the package RadEx, we have estimated the radius 
of the CoRoT stars (Table 6). For HD49933, we find a ra- 
dius of R=1.367 +0.016 Rq, while from spectroscopic mea- 
surements, R=1.34 +0.016 Rq. For HD181906, we obtain 
R=1.415 ±0.016 Re, to be compared to R=1.392 ±0.054 R^. 
Finally for HD 18 1420, spectroscopic observations have given 
R= 1.595 ±0.032 R^ we obtain R= 1.623 ±0.026 Rq. All of these 
values are in agreement. 

5. Discussion and conclusions 

We have described all the different packages of our A2Z pipeline 
to determine the global parameters of stellar oscillations: the 
mean large separation, the frequency range of the power excess 
due to the p modes, the background, the maximum bolometric 
amplitude per radial mode, the guess table of frequencies, and to 
obtain the radius and the mass of the star. 

The first most important step of the pipeline is the estima- 
tion of the large separation. We have adopted a 3-step strategy by 
comparing Method 1 and Method 2 of the package PMRS and 
by interchanging the results of each of them. We have selected 
the common results of the step where we obtain the highest num- 
ber of coincidences with the lowest number of zero confidence 
level. Applying this methodology to simulated stars has showed 
that we have obtained from 25 to 40% of common results with 
errors from 2 to 11%. An optional step consists in adding up the 
common results that are in the 2 best steps. This has led to 29 to 
40% common results with errors ranging from 2.8 to 16.8%. 

In term of magnitude of the stars, our pipeline seems to be 
more robust with stars of magnitude below 1 1 . 

It is not yet decided if the last step of our strategy, which 
adds up the common result from the two best steps, will be used 
in the future as the number of common results does not increase 
necessarily and the number of errors increases. It will mainly 
depend on the the data that we will have with Kepler 

For the estimation of the mode amplitude, we have managed 
to estimate 73% within 3 cr of the amplitudes for the Aarhus- 
asteroFLAG simulations. 

Concerning the estimation of the radius of the simulated stars 
using stellar models, we have succeeded in calculating 70% of 
the selected simulated stars. 

Using the data of the Sun and solar-Uke targets of the CoRoT 
mission, we have retrieved the global parameters already pub- 
lished and we have also estimated the radius and the mass of the 
stars with very good accuracy. 

By applying the A2Z pipeline to three CoRoT targets, 
we found for HD49933 <Av>=86.2 ^uHz, for HD181906, 
<Av>=86.53 fiHz, and for 181420, <Av)=75.35 fiHz. 

We were able to estimate the maximum amplitude per radial 
mode for these stars: from 3.9 to 4.08 ppm for HD49933 (de- 
pending on the run), 2.85 ppm for HD181906, and 3.89 ppm for 
HD191420. 

We estimated roughly the mass and the radius of these 
CoRoT targets with the FDRM package, which are, within the 
large error bars, compatible with the published values from spec- 
troscopic observations. The radii retrieved from stellar mod- 
els are in agreement with published results: for HD49933, 
R=1.367 ±0.016 Re, for HD181906, R=1.415 ±0.016 Rq, and 
for HD 18 1420, R= 1.623 +0.026 R^. 



We have to keep in mind that our codes have been modified 
and tuned to give the best results with the simulated data that we 
analysed. However, we might have to tune them again with real 
data from Kepler So they are not unchangeable and they might 
evolve in the next months. 

We have already applied this pipeline to one month of Kepler 
data and obtained some promising results with solar-type stars 
and red giants ( ,Bedding et al.,2010, ,Chaplin et al.|2010 Stello 
|etal.|2010D . 
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Table 1. Number of common results for the asteroFLAG and Aarhus/asteroFLAG stars, for the 3 steps of our strategy and the 
corresponding number of results with a confidence level. 



Star 


Step 1 


Level 


Step 2 


Level 


Step 3 


Level 


Arthur 


182 


11 


223 


10 


294 


62 


Boris 


300 


5 


305 


11 


531 


135 


Pancho 


231 


6 


259 


9 


310 


273 


Aarhus/asteroFLAG 


33 


3 


75 


4 


78 


36 



Table 2. Number of common results for all the steps of our strategy with the errors (incorrect results) as a function of the magnitude. 
The second half of the table shows the results when we add the connmon results of the two best steps. In the parentheses, we put the 
percentage of incorrect results. 



Star/V 


9 


Error 


10 


Error 


11 


Error 


12 


Error 


Arthur 


176 


2 (1%) 


25 


5 (20%) 


5 


5 (100%) 


7 


7 (100%) 


Boris 


209 


(0%) 


89 


2 (2%) 


3 


1 (33%) 


2 


2 (100%) 


Pancho 


179 


1 (0.5%) 


52 


11 (21%) 


10 


8 (80%) 


9 


8 (89%) 


Arthur 


182 


2(1%) 


39 


14 (36%) 


14 


12 (85%) 


14 


14 (100%) 


Boris 


214 


(0%) 


107 


3 (3%) 


7 


2 (28%) 


4 


4 (100%) 


Pancho 


197 


2 (1%) 


70 


14 (20%) 


13 


11 (84%) 


13 


11 (84%) 



Table 3. Radius (Rp) from stellar modelling for some Aarhus/asteroFLAG simulated stars compared to the simulation input (Rm)- 



KICID 


Rp (Re) 


cr(R) 


(^o) 




3648936 


1.454 


0.013 


1.393 


4.692cr 


4041406 


1.032 


0.008 


1.040 


l.OOOcr 


4757931 


1.416 


0.013 


1.424 


0.615cr 


4861991 


1.340 


0.021 


1.378 


I.8IO0- 


5790807 


1.540 


0.020 


1.559 


0.950a- 


6129877 


1.014 


0.008 


0.986 


3.5OO0- 


6307459 


1.368 


0.013 


1.379 


0.846O- 


8349612 


0.695 


0.000 


1.268 




8745924 


1.264 


0.022 


1.252 


0.545O- 


9204313 


1.133 


0.007 


1.120 


1.857cr 


10644253 


0.923 


0.014 


0.954 


2.214cr 


10454113 


1.253 


0.018 


1.320 


3.722cr 


3221671 


1.277 


0.013 


1.289 


0.923O- 


5295670 


1.466 


0.016 


1.404 


3.8750- 


6285677 


1.338 


0.022 


1.323 


0.682(7- 


6306896 


1.402 


0.005 


1.405 


O.6OO0- 


8561664 


1.073 


0.014 


1.083 


0.7 14o- 


10454113 


1.253 


0.018 


1.320 


3.7220- 



F=fitted; M=model 

Table 4. Pipeline results for real data: rotation period, mean large separation, frequency-range of the p modes, position of maximum 
amplitude, and bolometric maximum ampUtude per radial mode. 



Star Period (days) (Av> (uHz) c QjHz) (uHz) /^ax (;^Hz) v^ax (pHz) e (//Hz) A^ax (ppm) e (ppm) 



Sun 


27.8 


135.46 


3.26 


2040 


4920 


3074.7 


1.02 


2.99 


0.13 


HD49933 (1) 


3.2 


86.14 


1.58 


720 


3060 


1699.88 


1.05 


4.08 


0.15 


HD49933 (2) 


3.3 


86.33 


1.73 


720 


3000 


1804.34 


0.28 


3.90 


0.08 


HD181906 


2.8 


86.53 


1.69 


1200 


2460 


1896.9 


1.2 


2.85 


0.13 


HD 18 1420 


2.6 


75.35 


1.53 


480 


2760 


1536.26 


0.42 


3.89 


0.1 
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Table 5. Estimation of the radius (R ') and the mass (M') of real data with the package FDRM compared to spectroscopic observations 
(R(obs)andM(obs)). 



Star 






R (obs) (Re) 




e(Mo) 


M (obs) (Mo) 


Sun 


1.003 


0.1 


1 


1.0167 


0.2 


1 


HD49933 (1) 


1.483 


0.15 


1.34±0.016 


1.3279 


0.26 


1.2 


HD49933 (2) 


1.567 


0.15 


1.34±0.016 


1.5741 


0.35 


1.2 


HD181906 


1.581 


0.16 


1,392 ±0.054 


1.6231 


0.32 


1.144 ±0.119 


HD181420 


1.7255 


0.17 


1.595 ±0.032 


1.600 


0.32 


1.43 ±0.05 



Table 6. Radius and fitted parameters for real data from stellar modelling (package RadEx). 



Star 


Rf (Re) 


<r(R) (Re) 


reff,F (K) 


<Av)h (^Hz) 


<Ay>o OuHz) 


':<Av>Ob, (/'Hz) 


Sun 


1.016 


0.023 


5879 


135.46 


134.67 


3.26 


HD49933 


1.398 


0.040 


6835 


86.14 


84.60 


1.58 


HD181420 


1.613 


0.032 


6596 


75.35 


76.43 


1.53 


HD181906 


1.420 


0.018 


6294 


86.53 


86.12 


1.69 



F= fitted; O = Observed 



